# High Reward Performance
Sac Walker2d V3
This is a reinforcement learning model based on the SAC algorithm, specifically designed for the Walker2d-v3 environment to control bipedal robot walking.
Physics Model
S
sb3
43
0
Td3 HalfCheetah V3
This is a TD3 reinforcement learning agent trained using the stable-baselines3 library, specifically designed for the HalfCheetah-v3 environment, achieving an average reward of 9709.01.
Physics Model
T
sb3
23
0
Featured Recommended AI Models